N94- 35554 


TDA Progress Report 42-117 


- * - / 


lo 


May 15, 1994 


Trace-Shortened Reed-Solomon Codes 

R. J. McEliece 

California Institute of Technology, Department of Engineering 

G. Solomon 1 


Reed-Solomon (RS) codes have been part of standard NASA telecommunications 
systems for many years. RS codes are character-oriented error-correcting codes , and 
their principal use in space applications has been as outer codes in concatenated 
coding systems [6, Section 5.4.4]. However , for a given character size , say m bits, 
RS codes are limited to a length of, at most , 2 m . It is known in theory that 
longer character-oriented codes would be superior to RS codes in concatenation 
applications , but until recently no practical class of “long” character-oriented codes 
had been discovered. In 1992 , however, Solomon [4,5], discovered an extensive class 
of such codes, which are now called trace-shortened Reed-Solomon (TSRS) codes. 
In this article, we will continue the study of TSRS codes. Our main result is a 
formula for the dimension of any TSRS code, as a function of its error-correcting 
power. Using this formula, we will give several examples of TSRS codes, some 
of which look very promising as candidate outer codes in high-performance coded 
telecommunications systems. 


I. Construction Summary 

In this section, we will summarize the construction of 
trace-shortened Reed-Solomon (TSRS) codes, and state 
our main result (Theorem 1), which is a formula for the 
binary dimension of an arbitrary TSRS code. In Section II, 
we will give some numerical examples. In Section III, we 
will state and prove a theorem (Theorem 2) that is more 
general than Theorem 1. Finally, in Section IV, we will 
summarize our results and list several open problems. 


1 Independent consultant to the Communications Systems Research 
Section. 


We begin with a summary of TSRS codes, as intro- 
duced in [4,5]. For any length n of the form n = 2™ — 1, 
any desired minimum distance d < n — 1 , and any posi- 
tive integer p < m, there is a TSRS code of length n and 
minimum distance d over the symbol alphabet 
the space of binary (m — p)-tuples. TSRS codes are con- 
structed using properties of the Galois field GF( 2 m ). The 
field GF( 2 m “ /i ) does not come into play in the construc- 
tion, and so TSRS codes are not linear over the symbol 
field GF{ 2 m “ M ). However, they are linear over GF( 2), 
and the symbol-wise cyclic shift of any codeword is also 
a codeword. Our main result (Theorem 2) is a formula 
which allows the easy calculation of the binary dimension 
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of any TSRS code. We give several numerical examples, 
which show that TSRS codes are in some cases “almost 
maximum distance separable (MDS),” even though they 
are much longer than any true MDS code can be. 

We begin with an ordinary Reed-Solomon (RS) code, 
as originally defined [3]. That is, with m and k 0 fixed, we 
consider the set V of polynomials of degree ko — 1 or less 
over the field GF( 2 m ), and for each polynomial P E F^ 0 , 
we define a length n = 2 m — 1 codeword C(P) as follows: 

C(P) = (P(l),P(a), .... P(« n - 1 )) (1) 

where a is a primitive nth root of unity in GF( 2 m ). The 
set of all such codewords, i.e., {C(P) : P E P^ 0 }, is an 
(n,ko,d) RS code over GF( 2 m ), where n = 2 m — 1 and 
d = n — A’o + 1 . 

The words in the above-constructed RS codes are vec- 
tors of length n over the field GF( 2 m ). The next step 
in our construction is to “expand” each codeword into a 
length nm binary vector by representing each codeword 
symbol as a vector of length m over GF( 2). The par- 
ticular coordinate basis we shall use for this represen- 
tation is the basis which is dual to the “natural” basis 
{l,a, . . . , c* m-1 }. With respect to this dual basis, which 
we denote by {/?o, /?i, . . . , /? m _i }, an element x E GF( 2 m ) 

has representation x = x 0 A) H h where the 

components z*, which are elements of GF(2), are given by 
the formula 


X h = Tr?(xa h ) for h = 0, 1, . . . , m - 1 (2) 

In Eq. (2), and throughout the article, we use the symbol 
“TV” to denote the trace operator. The super- and sub- 
scripts denote the subfields involved. Thus Tr™(£)» the 
trace of £ from GF( 2 m ) to GF(2 1 ), represents the GF{ 2)- 
linear mapping from GF( 2 m ) to GF( 2 1 ), given by 


basis representation, which are summarized in [2, Chap- 
ter 8]. 


In any event, we call this “expanded” code the bit- 
mapped version of the RS code. 


Now we can define the trace-shortened Reed-Solomon 
codes. Given the bit-mapped RS code defined above, for 
any integer /i in the range 0 < // < m, we define the TSRS 
code of index // to be the set of bit-mapped RS codewords 
for which the first /i binary components of each codeword 
symbol are zero. That is, if C - (C 0 , C n _i) is a 
codeword in the original RS code, C is in the corresponding 
TSRS code of index jj, if and only if 


Tr^e^C.) = 0 


{ 


for h = 0,1, 
for 2 = 0, 1, 


fi - 1 

n — 1 


(5) 


If we delete the js guaranteed-zero binary components from 
each bit-mapped RS symbol, the TSRS code defined by 
Eq. (5) becomes a code of length n whose codeword com- 
ponents lie in the vector space V(2 m ~ of binary (m — //)- 
tuples. We note that while t / (2 m_/J ) is not a field, it is 
nevertheless a group, viz, the elementary abelian group of 
order 2 m-/i [1, Section 3.3]. Since this code is, by def- 
inition, a subcode of the parent RS code, its minimum 
distance is at least d. The bitwise sum of any two code- 
words satisfying Eq. (5) also satisfies Eq. (5), and so the 
code is a group code over the elementary abelian group 
V (2 m— ^). Also, any (symbolwise) cyclic shift of any code- 
word in the TSRS code is also a codeword, so the code 
is a cyclic group code. The determination of the binary 
dimension of the code is somewhat challenging, and is the 
main result of this article. In the next paragraph, we will 
discuss our formula for this dimension. 


Tr?(i)=t + e + --+e' n ~ l (3) 

Similarly, if d is a divisor of m, Tr™(£) denotes the trace of 
£ from GF( 2 d ) to GF( 2 m ), defined as the GF(2 d )-linear 
mapping from GF(2 m ) to GF(2 d ), given by 

Tr?(o=z+e d +e 3 i +--+e u ~ l)d w 

where / = m/d. Throughout the article, we will freely 
use the basic properties of the trace operator and the dual 


As we have seen, the TSRS code of index fi over 
GF(2 m ) is a subgroup of the group V{2 rn ~^ i ) n . The order 
of the code, i.e., the number of codewords, need not be a 
power of 2 m-/i , but since the code is linear over GF(2), 
it must be a power of 2, say 2 Kft . The following theorem 
gives a simple way to calculate . Before stating it, how- 
ever, we need to introduce so-called cyclotomic cosets of 
the field GF(2 m ), which are the cycles of the permuta- 
tion i — ► 2 i mod (2 m — 1) on the set {0, 1, . . . , 2 m — 2} [2, 
Chapter 7]. We shall denote the size of the jth cyclotomic 
coset by dj. For example, if m = 4, there are five such 
cyclotomic cosets: 


120 



J Cj dj 

0 ( 0 ) 1 

1 (1,2,4, 8) 4 

3 (3,6,12,9) 4 

5 (5,10) 2 

7 (7,14,13,11) 4 

Theorem 1. Denote by ej the number of integers in 
the set {0,1, . . . , ko — 1} which lie in the jth cyclotomic 
coset of GF(2 m ). Then the binary dimension of the index 
fi trace-shortened (n,&o) RS code is given by the formula 

max (me j — fidj , 0 ) ( 6 ) 

j 

In Section II, we will give two extended numerical ex- 
amples of TSRS codes. In Section III, we will give our 
proof of Theorem 1. Finally in Section IV, we will make 
some concluding remarks, and list several open problems 
about TSRS codes. 


II. Examples 

For our first example, we begin with the (15,9,7) RS 
code over the field GF(16). Here m = 4, n = 15, ko = 9. 
In the table below, we list the cyclotomic cosets of GF( 16), 
together with the numbers dj and ej, which are needed to 
apply Theorem 1. (For clarity, we list the numbers in the 
set {0, 1, . . . , 8} in boldface.) 

j Cj dj ej 

0 ( 0 ) 11 

1 ( 1 , 2 , 4, 8) 4 4 

3 (3,6,12,9) 4 2 

5 (5,10) 2 1 

7 (7,14,13,11) 4 1 

From this table and using Eq. (6), the binary dimension 
is given by the formula 

K? = (4 - fi)+ + (16 - 4/i) + + (8-4 n) + 

+ (4 — 2/i) + + (4 — 4/i) + 

where x+ is short for max(x,0). Thus we have 


/f 0 = 4+16 + 8 + 4 + 4 = 36 

K x = 3 + 12 + 4 + 2 + 0 = 21 

A 2 = 2 + 8 + 0 + 0 + 0 = 10 

I< 3 = 1 + 4 + 0 + 0 + 0= 5 


In other words, by “trace-shortening” the parent (15,9,7) 
RS code, we obtain the codes over V(8), V(4), and V(2), 
as shown in the table below. (In the following table, we 
extend the usual (n,Ar,d) notation for linear codes to the 
nonlinear TSRS codes by letting k denote the “pseudodi- 
mension” of the code, defined as k ^ = K^/(m — //).) 

Code parameters Symbol group 

(15.9.7) V (16) 

(15.7.7) V(S) 

(15.5.7) V(4) 

(15,5,7) V(2) 


Since the Singleton bound implies that any code with n = 
15 and d = 7 must have k < 9, it follows that the (15, 7, 7) 
code over V(8) is close to optimal. The (15,5,7) code 
over V(4) is not as good as the (15, 6, 7) generalized Bose- 
Chaudhari-Hocquenghem (BCH) code, but we could have 
obtained a code with the same parameters by starting with 
an alternate RS code. Finally, the (15, 5, 7) code over V(2) 
has the same parameters as the (15,5,7) BCH code. 

As our second example, we start with the (31, 27, 5) RS 
code over GF( 32). The distribution of the numbers in the 
set {0, 1, . . . , 26} is shown in the following table: 

j Cj dj ej 

0 ( 0 ) 11 

I ( 1 , 2 , 4 , 8 , 16 ) 5 5 

3 ( 3 , 6 , 12 , 24 , 17 ) 5 5 

5 ( 5 , 10 , 20 , 9 , 18 ) 5 5 

7 ( 7 , 14 , 28 , 25 , 19 ) 5 4 

II ( 11 , 22 , 13 , 26 , 21 ) 5 5 

15 ( 15 , 30 , 29 , 27 , 23 ) 5 2 


Thus we have from Theorem 1, 
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A'o = 5 + 25 + 25 + 25 + 20 + 25 + 10 = 135 

K\ = 4 + 20 + 20 + 20 + 15 + 20 + 5 = 104 

A' 2 = 3 + 15 + 15 + 15 + 10 + 15 + 0 = 73 

A 3 = 2+ 10+ 10+ 10+ 5+10+ 0 = 47 

A 4 = 1+ 5+ 5+ 5+ 0+ 5+ 0 = 21 

This leads to the following list of codes: 

Code parameters Symbol group 


(31.27.5) V(32) 

(31.26.5) V(16) 

(31.24.33.5) V(8) 

(31.23.5.5) V (4) 

(31.21.5) V(2) 

This list contains some interesting codes. In particular, 
the (31,26,5) code over V" (16) is optimal, and in fact is 
“nearly” MDS, despite the fact that it is twice as long as 
any MDS code over a 16-symbol alphabet. 

Let us conclude this section with a few bigger exam- 
ples, omitting the detailed computations. Beginning with 
a (511, 478, 34) RS code over GF{ 2 9 ), for which the parity- 
check polynomial is h(x) — fJjlfiCl “ a*z), a nd trace- 
shortening it with fi = 1, one obtains a (511,474,34) 
TSRS code over the symbol group V r (2 8 ). Similarly, be- 
ginning with a (511,470,42) RS code over GF( 2 9 ), with 
h(x) — nj=i(l “ c^x), a g a i n taking ji = 1, one obtains a 
(511,465,42) code over V r (2 8 ). Preliminary calculations 
performed by Dr. Fabrizio Pollara of the Communica- 
tions Systems Research Section indicate that when con- 
catenated with the NASA standard (7, 1/2) convolutional 
code, both of these TSRS codes give an overall perfor- 
mance which is superior to that of the standard (255, 223) 
RS code. 


III. Proof of Theorem 1 

In this section we will state and prove a theorem which 
is slightly more general than Theorem 1. 

We begin with the field F — GF( 2 m ), a positive integer 
n which is a divisor of 2 m — 1, and a primitive nth root of 
unity in F, say a. Let J be a subset of {0, 1, . . . , n — 1}, 


with \J\ = Ar 0 . We then define the code Cj to be the 
(n, *r 0 ) cyclic code over F, with check polynomial h(x ) = 
n j€ y(l — a^x). Equivalently, Cj consists of all vectors 
C = (C'o,C , i, C n -i) of the form 

Ci = '%2 e i aiS * = 0,1, .... n-l (7) 

where (cj), for j E J, is an arbitrary set of elements of 
F, indexed by J. For future reference, we denote the 
minimum distance of the code Cj by dj. Note that if 
the elements of J form an arithmetic progression mod- 
ulo n , whose increment is relatively prime to n, then 
dj = n — k o -f 1 by the BCH argument, and the code 
Cj is a (generalized) Reed-Solomon code. For example, if 
n — 2 m — 1 and J — { 0,1, . . . , ko — 1}, the code Cj is the 
same as the RS code defined in Eq. (1). 

We now define the /ith-order trace-shortening of Cj, de- 
noted by Cj i to be the set of codewords C E Cj such that 

Tr7Vo) = o {f "o 0 ;// ;;;; Hi} w 

In words, Cj consists of all codewords of Cj for which 
the first /i binary components (with respect to the ba- 
sis for GF(2 m ) over GF(2), which is dual to the basis 
{l,a, t* m_1 }) of each codeword symbol are zero. If 

we ignore these fin guaranteed-zero components, the code 
Cj becomes a code of length n over the set V(2 m ~ M ) of 
binary (m — //)-tuples. 

We denote the minimum symbol distance of the code 
Cj by dj. Since every codeword in Cj is also a codeword 
in the parent code Cj, the minimum symbol distance of 
Cj cannot be less than that of Cj, i.e., dj > dj. This 
simple bound is all we will have to say about the distance 
properties of the code Cj. Our main results concern the 
size of the code. 

Because the parent code Cj is a linear, cyclic code over 
GF(2 m ), it follows that the code Cj is closed under the 
operations of addition of any two codewords, and takes 
the (symbolwise) cyclic shift of any individual codeword. 
However, since the field GF( 2 m_/i ) does not come into 
play, Cj cannot be a linear code. Nevertheless, we will 
define a pseudodimension for Cj. 

Since, as observed, the sum of any two codewords from 
Cj is another codeword, Cj is a linear code over GF(2). 
Let us denote its GF(2)-dimension by K^. (Thus Kq — 
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Jfc 0 X m, I< m = 0.) Then the pseudodimension of is Theorem 2. The binary dimension K M of the code 

defined as is given by the formula 


k — 

k (j. — 


m — n 


-Ku 


( 9 ) 


K„ = ~ M)+ ( 10 ) 

jG/ 


Thus if \Cj\ denotes the number of codewords in Cj , we 
have that 

\Cj \ ~ 

The main result of this article (Theorem 2, below), 
is the determination of A ^ , and hence also for /j = 
0,1, . .., 77 i . To state the result, we need to define the 
modulo n cyclotomic cosets. 

Let n be an odd positive integer. If i and j are integers 
in the range 0 < i < n — 1, and if 2 s i = j (mod n) for 
some integer s, we say that i and j are conjugate mod- 
ulo n. It is easy to see that conjugation modulo n is an 
equivalence relation on the set {0,1, n — 1}, and so 
the set {0, 1, . . . , n - 1} is partitioned into a number of 
disjoint equivalence classes, which are called the modulo 
n cyclotomic cosets. Alternatively, the cyclotomic coset 
containing j , which we will denote by Tj, can be described 
explicitly as the set {j, 2 j, . . . , 2 d_1 j}, where d is the least 
positive integer such that 2 d j — j (mod n). The integer 
d is called the degree of j, written d = deg(j). In what 
follows, we will denote the cardinality of Tj by dj. It is 
easy to see that every element of Fj has degree dj , and 
that dj is a divisor of n. Finally, we denote by l n the set 
consisting of the smallest integers in each cyclotomic coset. 

Example 1. Let n = 15. A short calculation shows 
that there are five cyclotomic cosets modulo 15; indeed, 
we have /is — {0, 1, 3, 5, 7), and 

T 0 = (0) do = 1 

IT = (1,2, 4, 8) d l =4 

r 3 = (3,6,12,9) d 3 = 4 

r 5 = (5, 10) ds = 2 

r 7 = (7,14,13,11) d 7 = 4 


Thus deg(0) = 1, deg(l) = deg(2) = deg(4) — deg(8) — 4, 
etc. 

Now we can give our formula for the binary dimensions 
of the codes Cj. For each j 6 we define Jj = J fl r ; -, 
and tj = \ Jj\. (Compare to Theorem 1.) 


where (x)+ — max (x,0). 

To prove Theorem 2, we need several Lemmas. If n is a 
divisor of 2 m — 1, let P(x) be a polynomial of degree n — 1 
in the indeterminate x , with coefficients in F = GF( 2 m ): 


P(x) = ST p j xJ ’ (11) 

j = o 

Now define the polynomial V(x) as follows: 

V(x) = Tr™(A(:r)) mod (x n - 1) 

(12) 
j - 0 

where in Eq. (12) it is understood that Tr™(£) = £ + s 2 
-| Ef 2 ”" 1 , as in Eq. (3). 

Example 2. Let m = 4, n — 5. If P{x) = Pq 4- P\X 
+ P 2 x 2 + P 3 x 3 + P 4 X 4 , then 'P(x) = P(x) + P{x ) 2 + P{x) A 
-F P(x) s mod (jc 5 — 1) — 1 x (P 0 + Pq + Pq + Pq) + x 
x (Pi + pi + pt + pi) + x 2 x (Pi + pi + pi + p!) + x 3 
X (Pa + Pi + Pi + Pi) + x 4 x (P 4 + Pi + P, 4 + Pf ). Thus, 

Po = Po + P 0 2 + Po + Po 

Pi = P x + P 3 2 + P 4 4 + Pi 

p 2 = P 2 + P 2 + P 4 + P 4 8 

P 3 = P 3 + P| + P 4 + Pf 

P 4 = Pa + P 2 + Pi + Pf a 

Lemma 1. Let P(x) be a polynomial of degree 
n — 1, as defined in Eq. (11). Then Tr™(P(x)) = 0 for 
all x E {l,a, a 2 , ..., o n-1 } if and only if = 0 for 

j = 0, 1 , . . . , n - 1 . 


123 



(15) 


Proof: Since x n = 1 for all x E {l,a, . a n_1 }, 
it follows from Eq. (12) that Tr^(P(ar)) = 0 for all x E 
{1, a, . . . , a”' 1 } iff V(x) = 0 for all x E {1, a, . . . , a 71 ' 1 }. 
But degP(x) < n — 1, so that V(x) must be identically 
0. □ 

The next lemma gives an explicit formula for the coef- 
ficients Vj of V(x ), 


Tr^O = T >?(TV?(0) 


[ 2 , Theorem 8.2]. Combining Eqs. (14) and (15), we find 
that the coefficient of x 9 in V(x) is in fact the coefficient 
of x 9 in 


Lemma 2. For j E {0, 1, . . . , n — 1}, if d = deg(j), 
then 


Vj = Tr^ 


E p ;' 

ue r,- 


(13) 


Note: For a fixed j, for each g E Tj, there exists a unique 
integer i in the set { 0 , 1 , . . . , d — 1 } such that 2*0 mod n = 
j. The summation in Eq. (13) is to be understood to be 
over all pairs (g, i) such that g E Tj , i E {0, 1, . . . , d- 1}, 
and 2 *g mod n = j. We will observe this same convention 
in summations like that in Eq. (13) in the remainder of 
the article. 


Trf J Tr™ ^ P g x 9 j mod x n — 1 (16) 


Now since each element of Tj has degree d, it follows that 
Tr^P^x 5 ) = x 9 Tr™(P g ) mod x n — 1 for each g E Tj, so 
that Expression (16) becomes 


Tri J2 ?(P,) 


(17) 


Example 3. With n — 15 and j = 10, we have from 
Example 1 Tio — {5,10}, and so Lemma 2 implies that 
V 10 = Tr \{Pi + P w ) = (Pi + P,o) + (Pi + P I0 ) 4 = Pi 
+ P in + Pi + Pf 0 . □ 

Proof of Lemma 2: By definition (Eq. (12)), 


Recalling that Trf (£) = £ T £ 2 -f • • * -f £ 2< * 1 , we see that if 
i is the unique index in the range 0 < i < d — 1 such that 
2*0 — j, the t/th term in the sum in Eq. (17) contributes 
exactly 


V{x) = Tr™(P(z)) mod x n — 1 


n— 1 


= ^ Tr 7 , ( P g x 9 ) mod x n - 1 


9 = 0 


Since the exponents appearing in the expansion of 
Tr™ (P g x 9 ) are exactly those which are modulo n conju- 
gates of 0 , it follows that, in calculating the coefficient of 
x 9 in V(x ), only those indices g which are conjugates of 
j, i.e, elements of Tj , need be considered. In other words, 
the coefficient of x 9 in V(x) equals the coefficient of x 9 in 


Tr?(P 9 f =TrZ(P?) 


to the coefficient of x 9 . But this is exactly what Eq. (13) 
says. □ 


Corollary. If j\ and are conjugate modulo n, then 
Vj l and Py> are conjugates in GF(2 m ). More precisely, if 
j has degree d, and if s E {0, 1, . . . , d - 1}, then 


Tr™ j Pg x9 j mod x n — 1 

Wr, 


(14) 


^2'; mod n = Vj 


Now we need to invoke the fact that, if d is a divisor of 
m, then 


Proof: If we use Lemma 2 to compute Vj ' , we find 
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Tr 'J 


E 


9€Tj 

2 * g mod n=j 


Lemma 1, this will be true if and only if Vh,j = 0 f° r all h = 
0, 1, . . . , n - 1 and all j 6 J, where V h ,j is the coefficient 
of in the polynomial Vh{x) = Tr^/V^)) mod (jr n - 1) 
(Eq. (12)). 


= Tr? 

2 1 g mod n 


P l 


• +» 


= Tr 


m 

d 


2* 


E 

^3 


mod n = 2 s 


k 


= TV; D 

Example 4. For m — 4 and n — 5, we have from the 
Corollary, with j = 1 and s = 0, 1,2,3, that 


TV mod 5 = V X = Pi 


?Vmod5 =?2 = T>? 
TVmod5 —Va = V\ 


TV mod 5 — T*3 


T»f 


These relationships can be verified directly by referring to 
Example 2. ^ 

Now. we are prepared to begin the proof of Theorem 2. 
In effect, we wish to count the number of sets { c j)j£j such 
that Eq. (8) holds. If we substitute the formula given in 
Eq. (7) into Eq. (8), we find that ( Cj)j € j defines a word in 
the TSRS code C f if and only if 


Tr™ 



= 0 


f ft = 0,1, P-1 

\ x e {i,a, . . . , a" *} 


(18) 


Now, for h = 0, 1, . . . , p - 1, we define the polynomial 
P h (x) as 


Now by Lemma 2, if d = deg(j), the coefficient Vh,j is 
given by the formula 

TV; =Tr^ | £ cf a h2 J (19) 

where in Eq. (19), Jj = fl J. 

In summary, a set (cj) ; gj of elements from GF(2 m ) 
corresponds to a codeword in Cj if and only if 'Phji as 
defined in Eq. (19), is zero, for all h = 0, 1, * . . , M - 1 and 
all j G J. However, by the Corollary to Lemma 2, conju- 
gate j’s correspond to conjugate Vhf s, and so if Vh,j = 0 
for one element j of a given cyclotomic coset, it will be 
zero for all other elements of the coset as well. Thus in 
“solving” the equations Vh,j = 0, it is sufficient to restrict 
j to lie in the set / n , consisting of the least element of each 
cyclotomic coset. Thus if we want to count the number of 
sets ( Cj)j£j corresponding to codewords in the TSRS code 
C?, we get one set of equations of the form Vhj = 0 for 
fi z= 0,1, . . . , fi — 1 for each modulo n cyclotomic coset, 
i.e., each j G I n ■ 

To simplify the notation, for each g G Jj , where j G / n) 
we define x a = c 2 ' where according to our convention i 
is the unique index such that 2 l g mod n — J. Note that 
since the mapping £ — ► £ 2 is one to one, the c^’s can be 
uniquely recovered from the x g 's. For the remainder of the 
proof, we shall focus on the problem of determining when 
a set (x g ) ge j corresponds to a codeword in Cj. By the 
foregoing discussion and Eq. (19), this will be true if and 
only if 


Tr™ x 9 ah2 j ~ 0 for h = 0, 1, 1 

(20) 


Ph( x ) = ^2 ahc j xJ 

jeJ 

Thus Eq. (18) holds if and only if Tr^^M*)) = 0 for 
all x G {l,a, ..., or” -1 } , for all h = 0,1, . fi - 1. By 


for all j £ I n - Since a set of equations of the form of 
Eq. (20) involves only those variables x g corresponding to 
g's in a fixed cyclotomic coset, it follows that if the number 
of solutions to Eq. (20) is denoted by Nj , then the total 
number of codewords in the code Cj is simply Yij e /„ ^Yj* 
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Let T be one of the cyclotomic cosets modulo n, with 
|T| = d, and let E be a subset of T, with l^l = e. Let 
( x g)geE be e variables taking values in the field GF( 2 m ), 
which satisfy the /i simultaneous linear equations 


We will show that the rank of the mapping T is min(e/, fi) } 
so that the nullity of T is {ef — /i)+, and this will complete 
the proof. We will consider the cases ef > /i and ef < fi 
separately. 


Tr 


m 

d 



= 0 


for h = 0, 1, . . . , fi - 1 (21) 


Case 1: ef > /i. We need to show that rank(T) = //. 
This is equivalent to showing that there is no set of nonzero 
(A/J’s such that 


Theorem 3. The set of solutions {x g ) g ^E to Eq. (21) is 
a vector space over GF{ 2 d ) y of dimension {ef — //)+, where 
/ = m/d. The number of solutions is therefore 

Theorem 3, combined with the previous discussion, 
completes the proof of Theorem 2, since it implies that, 
for each j E / n , the number of solutions to Eq. (20) 
is 2 d jGjfj-»)+ f where fj = m/dj y which is the same as 
2( mej “^ d ^)+, as asserted in Theorem 2. 

Proof of Theorem 3: The fact that the set of so- 
lutions to Eq. (21) is a vector space over GF(2 d ) follows 
from the fact that Tr™ is a linear mapping from GF{ 2 m ) 
to GF(2 d ), i.e, that if x and x ( are elements of GF{ 2 m ), 
and if A and A' are elements of GF(2 d ) } then 

Tr™(Az + A'*) = A Tr?(x) + A' Tr ?(*) (22) 

Using Eq. (22), it is easy to see that if is one solution 
to Eq. (21), and if (x f g ) is another, and if A and A' are 
elements of GF(2 d ), then (Xx g -f X'x' g ) is also a solution to 
Eq. (21). In the remainder of the proof, we will show that 
the GF{ 2 d ) dimension of the solution space to Eq. (21) is 
{ef - /x) + . 

To simplify notation, let q = 2 d , so that 2 m = qf , and 
let r ~ Tr™. If now {x g ) g ^E is an arbitrary vector from 
GF(2 m ) e = GF{qfy, define, for h = 0,1, ...,/i-l, 

Vh = ^ x g Q (23) 

g£E 

and 

Zh = r{y h ) (24) 

Now let T be the GF(<?)-linear mapping from GF(qf) e = 
GF(q) e * to GF{q)^ } defined by 

T : (Zg)geE {Zh)h = 0, ,4-1 


P-1 

El x hZh = o (25) 

h=0 

for all vectors (x g ) in GF(q f ) e . If Eq. (25) is true, then 
from Eq. (24), we have 

p-i \ 

^ ^hUh | = 0 (26) 

h - 0 / 

for all (xg)'’ s. The inner sum in Eq. (26) is, by Eq. (23), 

E y * = E Xh E 

A=0 h—0 g£E 

= E E 

<7 E -E h=0 

It follows then that 


Pg I 0 
<?€£ / 

for all (x g ), where = Y^hZo ^h& h2 ' ■ But it is easy to see 
that this can hold if and only if (3 g — 0 for all g E E. In 
summary, then, Eq. (25) will be true for all (x p )’s if and 
only if 

p-i 

E *ha h2 ' = 0 for all g 6 E (27) 

h- 0 

Next, define the polynomial L(x ) as 

p- 1 

L ( x ) - ^2 Xhxh 
h = 0 
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Then Eq. (27) says L(a 2 ) = 0 for all g £ E, so that L(x) 
has |£j = e distinct roots in the set {a, a 2 , . . . , a 2 }. 
However, since the coefficients of L(x) are in GF{q) ) then 
every GF(q) conjugate of a root of L(x) is also a root. 
But conjugation with respect to GF(q) is the mapping 
/? — + f) q — (3 2 . But since a is a primitive root, each 
element of GF(2 m ) = GF(2 df ) of the form a 2 ‘ with 0 < 
i < d - 1 has exactly / GF(2 d ) conjugates, viz, a 2 for 
k = i, i + d, . . . , i -f d(f - 1). Hence by taking conjugates 
of the original e roots of L(x)> we obtain ef roots. But 
since ef > fi by assumption, and since deg L(x) < fi — 1, 
it follows that the coefficients of L(ar), i.e, the A^’s, are 
all zero. This means that no nontrivial relationship of the 
form of Eq. (25) can hold, which completes the proof in 
Case 1. 

Case 2: ef < fi. In this case we need to show that 
rank(T) = e/. But clearly rank(T) < e/, since the GF(q)- 
dimension of the space of all (ar^J’s is ef. However, by the 
argument in Case 1, the first ef components of ( z g ) are 
linearly independent, and so rank(T) > ef as well. Hence 
rank(T) — ef y as asserted. FS 


IV. Summary and Conclusions 

In this article, we have introduced an extensive class of 
symbol-oriented error-correcting codes, which have prop- 
erties much like those of Reed-Solomon codes, without, 
however, suffering from the major drawback of RS codes, 
viz, an intrinsic limitation on codeword length. As sub- 
codes of RS codes, these codes can be decoded by any RS 
decoding algorithm. However, the study of these codes is 


in its infancy, and we therefore close with a list of unsolved 
problems related to TSRS codes. 

(1) Our selection of the representation of an element 
from GF(2 m ) as a binary m-tuple was more or less 
arbitrary. If another representation is used, will a 
code of larger dimension result? 

(2) Devise an efficient encoding algorithm for an arbi- 
trary TSRS code, or at least a large class of them. 

(3) Determine the conditions under which TSRS codes 
are systematic over the symbol alphabet V(2 m ~ fi ). 
(A necessary condition for this to be so is that the 
pseudodimension k ^ be an integer. But in [4], it was 
shown by example that this condition is not suffi- 
cient.) 

(4) Study the combinatorial optimality of TSRS codes. 
For example, devise bounds on the cardinality of a 
(n,k,d) code over a ^-letter alphabet when q is a 
fixed fraction of n, say An, as n — ► oo. 

(5) Investigate the relationship between TSRS codes 
and generalized BCH codes, i.e., BCH codes whose 
symbol field is GF(2 d ) and whose locator field is 
GF( 2 m ), where d is a divisor of m. 

(6) Compare the distance properties of TSRS codes to 
algebraic geometry codes with approximately the 
same values of n, fc, and q. 

(7) Do TSRS codes meet (or exceed) the Gilbert- 
Varshamov bound? 
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